Handling Large Workloads by Profiling and Clustering
نویسنده
چکیده
View materialization is recognized to be one of the most effective ways to increase the Data Warehouse performance; nevertheless, due to the computational complexity of the techniques aimed at choosing the best set of views to be materialized, this task is mainly carried out manually when large workloads are involved. In this paper we propose a set of statistical indicators that can be used by the designer to characterize the workload of the Data Warehouse, thus driving the logical and physical optimization tasks; furthermore we propose a clustering algorithm that allows the cardinality of the workload to be reduced and uses these indicators for measuring the quality of the reduced workload. Using the reduced workload as the input to a view materialization algorithm allows large workloads to be efficiently handled.
منابع مشابه
Genetic Relationships among Three Yarrow Species Based on Phenotypic Traits and Peroxidase Profiling
Fifteen yarrow populations from different species Achillea millefolium L., A. biebersteinii L. and A. nobilis, from different geographical areas of Iran were studied using 24 morphological traits and peroxidase profiles. Comparison of mean values of different phenotypic traits show A. millefolium and A. biebersteinii L. had higher plant height and crown diameter; however, A. nobilis had higher ...
متن کاملExpression Profiling of Microarray Gene Signatures in Acute and Chronic Myeloid Leukaemia in Human Bone Marrow
Background Classification of cancer subtypes by means of microarray signatures is becoming increasingly difficult to ignore as a potential to transform pathological diagnosis nonetheless, measurement of Indicator genes in routine practice appears to be arduous. In a preceding published study, we utilized real-time PCR measurement of Indicator genes in acute lymphoid leukaemia (ALL) and acute m...
متن کاملOPTIMIZATION OF FUZZY CLUSTERING CRITERIA BY A HYBRID PSO AND FUZZY C-MEANS CLUSTERING ALGORITHM
This paper presents an efficient hybrid method, namely fuzzy particleswarm optimization (FPSO) and fuzzy c-means (FCM) algorithms, to solve the fuzzyclustering problem, especially for large sizes. When the problem becomes large, theFCM algorithm may result in uneven distribution of data, making it difficult to findan optimal solution in reasonable amount of time. The PSO algorithm does find ago...
متن کاملFlexSplit: A Workload-Aware, Adaptive Load Balancing Strategy for Media Cluster
A number of technology and workload trends motivate us to consider a new request distribution and load balancing strategy for streaming media cluster. First, in emerging media workloads, a significant portion of the content is short and encoded at low bit rates. Additionally, media workloads display a strong temporal and spatial locality. This makes modern servers with gigabytes of main memory ...
متن کاملBluetooth protocol profiling on the Xilinx Virtex II Pro
Nowadays, there is an increasingly stronger trend to integrate a multitude of functionalities into a single device. Traditionally, this has been achieved by utilizing more powerful general-purpose processors to handle the additional workload. Since then, application-specific processors (acting as co-processors or hardware accelerators) were introduced to offload part of these workloads and to m...
متن کامل